On comparing and combining intra-speaker variability compensation and unsupervised model adaptation in speaker verification
نویسندگان
چکیده
In this paper an unsupervised intra-speaker variability compensation method, ISVC, and unsupervised model adaptation are tested to address the problem of limited enrolling data in text-dependent speaker verification. In contrast to model adaptation methods, ISVC is memoryless with respect to previous verification attempts. As shown here, unsupervised model adaptation can lead to substantial improvements in EER but is highly dependent on the sequence of client/impostor verification events. In adverse scenarios, unsupervised model adaptation might even provide reductions in verification accuracy when compared with the baseline system. In those cases, ISVC may outperform adaptation schemes. It is worth emphasizing that ISVC and unsupervised model adaptation are compatible and the combination of both methods always improves the performance of model adaptation. The combination of both schemes can lead to improvements in EER as high as 34%.
منابع مشابه
Unsupervised intra-speaker variability compensation based on Gestalt and model adaptation in speaker verification with telephone speech
In this paper an unsupervised compensation method based on Gestalt, ISVC, is proposed to address the problem of limited enrolling data and noise robustness in text-dependent speaker verification (SV). Reductions in EER and in the integral below the ROC curve as high as 20% or 40% and 30% or 60%, respectively, can be achieved by ISVC independently of the number of enrolling utterances. In contra...
متن کاملUnsupervised Compensation of Intra-Session Intra-Speaker Variability for Speaker Diarization
This paper presents a novel framework for unsupervised compensation of intra-session intra-speaker variability in the context of speaker diarization. Audio files are parameterized by sequences of GMM-supervectors representing overlapping short segments of speech. Session-dependent intra-session intra-speaker variability is estimated in an unsupervised manner, and is compensated using the nuisan...
متن کاملSeparating speaker and environment variabilities for improved recognition in non-stationary conditions
In this paper we address the problem of speaker adaptation in noisy environments. We estimate speaker adapted models from noisy data by combining unsupervised speaker adaptation with noise compensation. We aim at using the resulting speaker adapted models in environments that differ from the adaptation environment, without a significant loss in performance. The key idea is to separate speaker a...
متن کاملJoint Environment and Speaker Adaptation
In this paper we address the problem of speaker adaptation in noisy environments. We aim at estimating speaker adapted models from noisy data by combining unsupervised speaker adaptation with model-based noise compensation. Speaker adapted models obtained with this method should contain as little information about the environment as possible, so that they can be reused in different environments...
متن کاملIntra-speaker variability compensation in speaker verification with limited enrolling data
In this paper a compensation method is proposed to address the problem of limited enrolling data in speaker verification. Instead of adapting the client HMM, the technique presented here modifies the verification speech signals by maximizing the a posteriori p.d.f. in order to optimize the reduction in intra-speaker variability. The proposed approach can lead to reductions of 38.9% and 61.8% in...
متن کامل